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Abstract — [] In this paper, the bit energy requirements of 
training-based transmission over block Rayleigh fading channels 
are studied. Pilot signals are employed to obtain the minimum 
mean-square-error (MMSE) estimate of the channel fading coef- 
ficients. Energy efficiency is analyzed in the worst case scenario 
where the channel estimate is assumed to be perfect and the error 
] in the estimate is considered as another source of additive Gaussian 
noise. It is shown that bit energy requirement grows without bound 
as the SNR goes to zero, and the minimum bit energy is achieved 
at a nonzero SNR value below which one should not operate. The 
effect of the block length on both the minimum bit energy and the 
SNR value at which the minimum is achieved is investigated. Flash 
training schemes are analyzed and shown to improve the energy 
efficiency in the low-SNR regime. Energy efficiency analysis is also 
carried out when peak power constraints are imposed on pilot 
signals. 

I. Introduction 

One of the challenges of wireless systems is to estabUsh 
communication under time-varying channel conditions experi- 
enced due to mobility and changing environment. One technique 
employed in practical systems to cope with this challenge is 
to periodically send training signals to estimate the channel. 
Often, the channel estimate is considered as perfect, and trans- 
mission and reception is designed for a known channel. Due 
to its practical significance, training schemes has been studied 
extensively. Tong et al. in [1] present an overview of pilot- 
assisted wireless transmissions and discuss design issues from 
both information-theoretic and signal processing perspectives. 
The information-theoretic approach considers the optimization 
of training parameters to maximize the achievable data rates. 
For instance, Hassibi and Hochwald [2] optimized the training 
data, power, and duration in multiple-antenna wireless links 
by maximizing a lower bound on the channel capacity. The 
general theme in information-theoretic approaches (see e.g., 
[1] and references therein) is that training-based transmission 
schemes are close to being optimal at high signal-to-noise (SNR) 
values but highly suboptimal in the low-SNR regime due to poor 
channel estimates. 

Another important concern in wireless communications is 
the efficient use of limited energy resources. In systems where 
energy is at a premium, minimizing the energy cost per unit 
transmitted information will improve the efficiency. Hence, the 
energy required to reliably send one bit is a metric that can be 
adopted to measure the performance. Generally, energy-per-bit 
requirement is minimized, and hence the energy efficiency is 
maximized, if the system operates in the low-SNR regime [5], 
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[6]. Since training -based schemes perform poorly at low SNRs 
especially if the channel estimate is presumed to be perfect, 
this immediately calls into question the energy-efficiency of 
pilot-assisted systems. With this motivation, we present in 
this paper an energy-efficiency perspective on pilot-assisted 
wireless transmission schemes and seek to answer the question 
of how low should the SNR be so that the energy efficiency is 
compromised. 

II. Channel Model 

We consider Rayleigh block-fading channels where the input- 
output relationship within a block of m symbols is given by 
y = hx + n (1) 

where h ~ CAfiO,-/^) is a zero-mean circularly symmetric 
complex Gaussian random variable with variance £'{|/ip} = 
7^, and n is a zero-mean, m complex-dimensional Gaussian 
random vecto:[l with covariance matrix E{nn'} — NqI. x and 
y are the m complex-dimensional channel input and output 
vectors respectively. The input is subject to an average power 
constraint 

£;{||x|p} < mP. (2) 

It is assumed that the fading coefficients stay constant for a 
block of m symbols and have independent realizations for each 
block. It is further assumed that neither the transmitter nor the 
receiver has prior knowledge of the realizations of the fading 
coefficients. 

III. Training-Based Transmission and Reception 

We assume that pilot symbols are employed in the system to 
facilitate channel estimation at the receiver Hence, the system 
operates in two phases, namely training and data transmission. 
In the training phase, pilot symbols known at the receiver are 
sent from the transmitter and the received signal is 



yt = hxt + nt 



(3) 



where yt, Xt, and rij are /-dimensional vectors signifying the 
fact that / out of m input symbols are devoted to training. It is 
assumed that the receiver employs minimum mean-square error 
(MMSE) estimation to obtain the estimate 

h = E{h\yt} = , 4yt- (4) 



Nr 



~ CAf{d, S) is used to denote that x is a complex Gaussian random 
vector with mean -E{x} = d and covariance E{{x — d)(x — d)t} = S 
^Note that in the channel model (T), y, x, and n are column vectors. 



With this estimate, the fading coefficient can now be expressed 
as 



where 



h = h + h 



where 



and 



7'l|xt|P+iVo 



(5) 



(6) 



(7) 



Following the training phase, the transmitter sends the (m — 
0-dimensional data vector and the receiver equipped with 
the knowledge of the channel estimate operates on the received 
signal 



Yd = hxd + hxd + Ud 
to recover the transmitted information. 



(8) 



IV. Energy Efficiency in the Worst Case Scenario 

A. Average Power Limited Case 

Our overall goal is to identify the bit energy values that 
can be attained with optimized training parameters such as the 
power and duration of pilot symbols. The least amount of energy 
required to send one information bit reliably is given b}0 



snr 



(9) 



iVo C(snr) 

where C(snr) is the channel capacity in bits/symbol. In general, 
it is difficult to obtain a closed-form expression for the capacity 
of the channel ([S). Therefore, we consider a lower bound on the 
channel capacity by assuming that 

z = hxd + nd (10) 

is a Gaussian noise vector that has a covariance of 



E{zz^ = afE{,^d^l} + Nol, 



(11) 



and is uncorrected with the input signal x^. With this assump- 
tion, the channel model becomes 



Yd = hxd + z. 



(12) 



This model is called the worst-case scenario since the channel 
estimate is assumed to be perfect, and the noise is modeled as 
Gaussian, which presents the worst case [2]. The capacity of the 
channel in ( fT2l l. which acts as a lower bound on the capacity 
of the channel in ([Sj, is achieved by a Gaussian input with 

m — 1 

where S* is the optimal fraction of the power allocated to the 
pilot symbol, i.e., — 5*mP. The optimal value is given 
by 



5* = V^(ry + 1) - 



(13) 



*Note that is the bit energy normalized by the noise power spectral level 
Nq. 



mSNR 



1) 



77 



and SNR 



^2p 



(14) 



m(m - 2)SNR " A^o 

Note that SNR in (fl4] l is the received signal-to-noise ratio. In 
the average power limited case, sending a single pilot is optimal 
because instead of increasing the number of pilot symbols, a 
single pilot with higher power can be used and a decrease in 
the duration of the data transmission can be avoided. Hence, the 
optimal Xrf is an (m — l)-dimensional Gaussian vector Since 
the above results are indeed special cases of those in [2], the 
details are omitted. The resulting capacity expressiorH is 

,2 



Cl(snr) 



where 



TO— 1 

TO 
TO— 1 

TO 



SNR SNR^ 



?/'(snr)snr + (to-1) 
{log (1 + /(sNR)|wp) } nats/symbol 



<?!)(snr) = (5*(1 - r)TO^ 
V'(snr) = (1 + (to - 2)(5*)to, 



(15) 

(16) 
(17) 



and w ^ CAf{0, 1). Note also that the expectation in (fTsT i is 
with respect to the random variable w. The bit energy values 
in this setting are given by 

Eb^U SNR 



Nn 



Cl(snr; 



log 2. 



(18) 



provides the least amount of normalized bit energy values 
in the worst-case scenario and also serves as an upper bound 
on the achievable bit energy levels of channel (O. It is shown 
in [4] that if the channel estimate is assumed to be perfect, and 
Gaussian codebooks designed for known channels are used, and 
scaled nearest neighbor decoding is employed at the receiver, 
then the generalized mutual information has an expression 
similar to O (see [4, Corollai-y 3.0.1]). Hence also 
gives a good indication of the energy requirements of a system 
operating in this fashion. The next result provides the asymptotic 
behavior of the bit energy as SNR decreases to zero. 

Lemma 1: The normalized bit energy (fTSl l grows without 
bound as the signal-to-noise ratio decreases to zero, i.e.. 



Eb,u 



No 



SNR 

= lim — — - 

Ci^=0 snr^oCl(snr) 



1 9 lQg2 
log 2 = —. — — = 00. 



(19) 



Proof: In the low SNR regime, we have 

TO — 1 



Cl(snr) 



TO 
TO — 1 



(/(SNR)i;{|w;|2} + o(/(SNR))) (20) 



(/(SNR) + o(/(SNR))) 



(21) 



As SNR -^0,6*^ 1/2, and hence (/)(snr) — > to^/4 and 
?/;(snr) ^ to + to(to — 2)/2. Therefore, it can easily be seen 
that 



/(snr) = 



-SNR^ + o(SNR^) 



4(to - 1) 

from which we have C'l(O) = 0. 

^Unless specified otherwise, all logarithms are to the base e. 



(22) 
□ 
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Fig. 1. Energy per bit Ei, jj/Nq vs. SNR in the worst-case scenario 



Fig. 2. Minimum energy per bit 



1^0 min ' 



VS. — in tlie worst-case scenario 



The fact that Cl decreases as SNR^ as SNR goes to zero has 
akeady been pointed out in [2]. Lemma [T] shows the impact 
of this behavior on the energy-per-bit, and indicates that it is 
extremely energy-inefficient to operate at very low SNR values. 
We further conclude that in a training-based scheme where the 
channel estimate is assumed to be perfect, the minimum energy 
per bit is achieved at a finite and nonzero SNR value. This most 
energy-efficient operating point can be obtained by numerical 
analysis. We can easily compute Cl(snr) in ( fT5] ), and hence 
the bit energy values, using the Gauss-Laguerre quadrature 
integration technique. 

Figure [T] plots the normalized bit energy curves as a function 
of SNR for block lengths of m = 3, 5, 10, 20, 50, 100, 200, 10^. 
As predicted, for each block length value, the minimum bit 
energy is achieved at nonzero SNR, and the bit energy re- 
quirement increases as SNR 0. It is been noted in [2] that 
training-based schemes, which assume the channel estimate to 
be perfect, perform poorly at very low SNR values, and the exact 
transition point below which one should not operate in this 
fashion is deemed as not clear Here, we propose the SNR level 
at which the minimum bit energy is achieved as a transition 
point since operating below this point results in higher bit 
energy requirements. Another observation from Fig. [T] is that 
the minimum bit energy decreases with increasing m and is 
achieved at a lower SNR value. The following result sheds a 
light on the asymptotic behavior of the capacity as m ^ oo. 

Theorem 1: As the block length m increases, Cl approaches 
to the capacity of the perfectly known channel, i.e., 

lim Cl(snr) = i?^{log(l + SNR|wp)}. 



Moreover, define x~ ^/m. Then 

dCL (SNR) 



Proof : We have 



dx 



-oo. 



(23) 



(24) 



lim Cl(snr) = lim £;^{log(l + /(SNR)|w|2)} (25) 

m — >-oo m— *oo 

= I lim log (1 + /(SNR)|w;|2) I (26) 

= S^{log(l + SNR|u;p)}. (27) 



holds due to integrable upper bound 
|log(l+/(SNR)|wp)| < 3SNR|it;p and the Dominated 
Convergence Theorem. (l24l l follows again from the application 
of the Dominated Convergence Theorem and the fact that the 
derivative of /(snr) with respect to x = 1/m at x = is 
-oo. □ 
The first part of Theorem [T| is not surprising and is expected 
because reference [7] has already shown that as the block length 
grows, the perfect knowledge capacity is achieved even if no 
channel estimation is performed. This result agrees with our 
observation in Fig.[T]that —1.59 dB is approached at lower SNR 
values as m increases. However, the rate of approach is very 
slow in terms of the block size, as proven in the second part of 
Theorem [T] and evidenced in Fig. |2] Due to the infinite slop^ 
observed in the figure, approaching — 1.59dB is very demanding 
in block length. 

B. Flash Training and Transmission 

One approach to improve the energy efficiency in the low SNR 
regime is to increase the peak power of the transmitted signals. 
This can be achieved by transmitting ly fraction of the time with 
power P/ V. Note that training also needs to be performed only 
V fraction of the time. This type of training and communication, 
called flash transmission scheme, is analyzed in [8] where it is 
shown that the minimum bit energy of — 1.59dB can be achieved 
if the block length m increases at a certain rate as SNR decreases. 
In the setting we consider, flash transmission scheme achieves 
the foUowing rate: 



C/L (snr, J/) i^(SNR)CL 



SNR 



(28) 



^i^(snr)^ 

where < ) < 1 is the duty cycle which in general is 
a function of the SNR. First, we show that flash transmission 
using peaky Gaussian signals does not improve the minimum 
bit energy. 

Proposition 2: For any duty cycle function 

. SNR . SNR 

mf — — > mf — — -. (29) 

SNR C/l(snr, v) snr Cl(snr) 

^Note that Theorem [T] implies that the slope of g ™NR) ^'^ ^ ~ m ~ ^ 



Proof : Note that for any SNR and i^(snr), 

SNR 



SNR 



y(SNR) 



SNR 



C/i(SNR,i/) 



SNR A 
ly(SNR) ) 



> inf 



SNR 



Cl (SNR) ~ SNR Cl (snr 



(30) 



where SNR is defined as the new SNR level. Since the inequality 
in (l30l l holds for any SNR and it also holds for the infimum 
of the left-hand side of (l30l l. and hence the result follows. □ 
We classify the duty cycle function into three categories; 



- non-flashy transmission 



SNR 



1) that satisfies limsNR^o UfsNR) 

2) that satisfies limsNR^o ^fsNR) 

SNR 



3) i^(-) that satisfies linisNR- 
stant a > 0. 



»0 ,y(SNR) 





oo 

- a for some con- 



(lash transmissio 



Next, we analyze the performance of each category of duty 
cycle functions in the low-SNR regime. 

Theorem 3: If is chosen from either Category 1 or 2, 

Eb,u 



No 



lim 



SNR 



CfL=0 



SNR^O C/l(SNR, 1/' 



■log 2 



If iy{-) is chosen from Category 3, 



b,U 



iVn 



m-lE.^{\og2il + f{a)\w\^)}- 



Proof : We first note that by Jensen's inequality, 

SNR 



CfifSNR, ly) m-1 i/(snr) , 
-J-tl < ^ L log 



SNR 



SNR 



1 + / 



j/(snr) 



def 



C(SNR, I/) 



(31) 



(32) 



(33) 



(34) 



Fig. 3. Energy per bit Ef, u /Nq vs. SNR with flash transmission 



The above upper bound is given in the proof of Theorem [T] 
Finally, ( l39l ) follows from the continuity of the logarithm. □ 
Theorem [3] shows that if the rate of the decrease of the duty 
cycle is faster or slower than SNR as SNR 0, the bit energy 
requirement still increases without bound in the low-SNR regime. 
This observation is tightly linked to the fact that the capacity 
curve Cl has a zero slope as both SNR — > and SNR oo. For 
improved performance in the low-SNR regime, it is required that 
the duty cycle scale as snr. A particularly interesting choice is 

1 



SNR 



-SNR 



First, we consider category 1. In this case, as SNR 0, 
lyfsNR) ^ shown before, the logarithm in ( [33] ) scales 



as 



SNR^ 



i/(SNR^) 

leading to 



as SNR 0, and hence C(SNR, ly) scales as ^^^g'^^^ 



lim ££ii(^< lin, C(SNR,.) = 0. (35) 

SNR-^O SNR SNR^O 

In category 2, ^^^^^-^ grows to infinity as SNR 0. Since the 

log(-) function on the right hand side of ^ increases only c. Peak Power Constraint on the Pilot 



where a* is equal to the SNR level at which the minimum bit 
energy is achieved in a non-flashy transmission scheme. Fig. |3] 
plots the normalized bit energy as a function of SNR for 
block size m = 10. The minimum bit energy is achieved at 
SNR = 0.8. For SNR < 0.8, flash transmission is employed with 
z^(snr) — 1/0.8 SNR. As observed in the figure, the minimum 
bit energy level can be maintained for lower values of SNR at 
the cost of increased peak-to-average power ratio. 



logarithmically as 



SNR 

ly(SNR) 



OO, we can easily verify that 



lim £££(^< 



lim C(SNR, v) = 0. (36) 

SNR^O 



SNR-+0 SNR 

In category 3, i/(SNR) decreases at the same rate as SNR. In this 
case, we have 



Heretofore, we have assumed that there are no peak power 
constraints imposed on either the data or pilot symbols. Recall 
that the power of the pilot symbol is given by 



6*mP = + mP) - ^ 



(40) 



lim 
SNR^O 



SNR 



lim 

n — >oo 



(37) 



where ^ = ""^^^l^^hP^" ■ We immediately observe from gOl 



^£;»{lim„^^log(l + /(J^)|«;|2)} 



Epilog {l + f{a)\w\^)} 



(38) 

(39) 

a 

(|38T l is justified by invoking the Dominated Convergence The- 
orem and noting the integrable upper bound 



log ( 1 + / ( — 

nv 



1 3 
< 3 — IwP < -IwP for n > 1. 



that the pilot power increases at least as y/m as m increases. 
For large block sizes, such an increase in the pilot power may 
be prohibitive in practical systems. Therefore, it is of interest 
to impose a peak power constraint on the pilot in the following 
form; 

\xt\^ < kP. (41) 

Since the average power is uniformly distributed over the data 
symbols, the average power of a data symbol is proportional 
to P and is at most (1 — d*)2P for any block size. Therefore, 
K can be seen as a limitation on the peak-to-average power 
ratio. Note that we will allow Gaussian signaling for data 



4 



Fig. 4. Energy per bit Ei, jj /Nq vs. SNR for block sizes of m 
50, 100, 200, 10^, 10*. The pilot peak power constraint is \xt\^ < lOP. 



1 r. 



Fig. 5. Number of pilot symbols per block vs. SNR 



TABLE I 



transmission. Hence, there are no hard peak power Hmitations 
on data signals. This approach will enable us to work with a 
closed-form capacity expression. Although Gaussian signals can 
theoretically assume large values, the probability of such values 
is decreasing exponentially. 

If the optimal power allocated to a single pilot exceeds kP, 
i.e., 5*mP > kP ^ 6*m > k, the peak power constraint on 
the pilot becomes active. In this case, more than just a single 
pilot may be needed for optimal performance. 

In this section, we address the optimization of the number of 
pilot symbols when each pilot symbol has fixed power \xt^i\'^ = 
kP Vi. If the number of pilot symbols is / < m, then ||xt|p = 
IkP and, as we know from Section |lll] 



/i - CA/" 0, 



and /i - C7V 0, 



^^kP + Na 



Similarly as before, when the estimate error is assumed to be 
another source of additive noise and overall additive noise is 
assumed to be Gaussian, the input-output mutual information 
achieved by Gaussian signaling is given by 

777/ — / 

Il.p £;^{log(l + g(SNR,Okn} 



(42) 



where w ~ CA/'(0, 1) and 
S((SNR, I) = 



lK{m - /k)snr^ 



{m — Ik, + (m ~ /)Zk)snr + m - T 



(43) 



The optimal value of the training duration / that maximizes p 
can be obtained through numerical optimization. Fig. |4] plots 
the normalized bit energy values ^^R log 2 obtained with 

optimal training duration for different block lengths. The peak 
power constraint imposed on a pilot symbol is < lOP. 

Fig. |5] gives the optimal number of pilot symbols per block. 
From Fig. |4l we observe that the minimum bit energy, which 
is again achieved at a nonzero value of the SNR, decreases with 
increasing block length and approaches to the fundamental limit 
of — 1.59dB. We note from Fig. |5]that the number pilot symbols 
per block increases as the block length increases or as SNR 
decreases to zero. When there are no peak constraints, S*m ^ 
m/2 as SNR 0. Hence, we need to allocate approximately half 
of the available total power mP to the single pilot signal in the 





-'^0 min 


# of pilots 


SNR 


%r . (dB) (no 

^^0 mm 
peak constraints) 


m = 50 


1.441 


1 


0. 41 


1.440 


m = 100 


0.897 


2 


0.28 


0.871 


m = 200 


0.413 


3 


0.22 


0.404 


m = 500 


^0.079 


5 


0.16 


- 0.085 


m = 10^ 


-0.375 


9 


0.12 


-0.378 


m = 10* 


-1.007 


44 


0.05 


-1.008 



low-power regime, increasing the peak-to-average power ratio. 
In the limited peak power case, this requirement is translated 
to the requirement of more pilot symbols per block at low SNR 
values. 

Table I] lists, for different values m, the minimum bit energy 
values, the required number of pilot symbols at this level, and 
the SNR at which minimum bit energy is achieved. It is again 
assumed that k — 10. The last column of the table provides 
the minimum bit energy attained when there are no peak power 
constraints on the pilot signal. As the block size increases, the 
minimum bit energy is achieved at a lower SNR value while a 
longer training duration is required. Furthermore, comparison 
with the last column indicates that the loss in minimum bit 
energy incurred by the presence of peak power constraints is 

negligible. „ 
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